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Abstract 

We study the problem of type inference for a family of polymorphic type disciplines con¬ 
taining the power of Core-ML. This family comprises all levels of the stratification of the 
second-order lambda-calculus by “rank” of types. We show that typability is an undecidable 
problem at every rank k > 3 of this stratification. While it was already known that typability 
is decidable at rank < 2, no direct and easy-to-implement algorithm was available. To design 
such an algorithm, we develop a new notion of reduction and show how to use it to reduce the 
problem of typability at rank 2 to the problem of acyclic semi-unification. A by-product of our 
analysis is the publication of a simple solution procedure for acyclic semi-unification. 
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1 Introduction 


Background and Motivation. Modern type systems for functional programming languages 
use polymorphic type inference. Type inference for untyped or partially typed programs saves 
the programmer the work of specifying the type of every identifier. Polymorphism lets the pro¬ 
grammer write polymorphic functions that work uniformly on arguments of different types and 
avoids the maintenance problem that results from duplicating similar code at different types. 
The first programming language to use polymorphic type inference was the functional language 
ML [GMW79, Mil85]. Due to its usefulness, many of the aspects of ML have been subsequently 
incorporated in other languages (e.g. Miranda [Tur85]). ML shares with Algol 68 properties of 
compile-time type checking, strong typing and higher-order functions while also providing type 
inference and polymorphism. 

The usefulness of a particular polymorphic type system depends very much on how feasible the 
tasks of type checking and type inference are. We define these concepts in terms of the untyped 
A-calculus, which we use as our pure functional programming language throughout this paper. By 
type checking we mean the problem of deciding, given a A-term M and a type r, whether r is one 
of the types that may be derived for M by the type system under consideration. By type inference 
we mean the problem of finding a type derivable for a A-term in the type system. The problem of 
type inference involves several issues: 

(1) Is typability decidable, i.e. is it decidable whether any type at all is derivable for a 
A-term in the type system? 

If typability is undecidable, then there is little more to say in relation to type inference. (Although 
a programming language may work around this problem by asking the programmer to supply 
part of the type information and by using heuristics, we will omit discussion of this possibility.) 
Otherwise, if typability is decidable, then it is possible to construct a type for typable A-terms, i.e. 
type inference can be performed, in which case we further ask: 

(2) How efficiently can typability be decided? How efficiently can type inference be 
performed? 

(3) Can a principal type (a “most general” type) be constructed for typable A-terms? 

The answers to these questions determine how feasible the type system is to implement. 

In addition to the feasibility of a particular polymorphic type system, its usefulness also depends 
on how much flexibility the type system gives the programmer. Although the polymorphism of ML 
is useful, it is too weak to assign types to some program phrases that are natural for programmers 
to write. To overcome these limitations researchers have investigated the feasibility of type systems 
whose typing power is a superset of that of ML. Over the years, this line of research has dealt 
with various polymorphic type systems for functional languages and A-calculi, in particular the 
powerful type system of the Girard/Reynolds second-order A-calculus [Gir72, Rey74], which we 
will call by its other name, System F. In the long quest to settle the type checking and typability 
problems for F, researchers have also considered the problem for F modified by various restrictions. 
Multiple stratifications of F have been proposed, e.g. by depth of bound type variable from binding 
quantifier in [GRDR91] and by limiting the number of generations of instantiation of quantifiers 
themselves introduced by instantiation in [Lei91]. One natural restriction which we consider in 
this paper results from stratifying F according to the “rank” of types allowed in the typing of 
A-terms and restricting the rank to various finite values (introduced in [Lei83] and further studied 
in [McC84, KT92]). All of these systems improve on the expressive power of ML. 

Unfortunately, it is often the case that the more flexible and powerful a particular polymorphic 
type system is, the more likely that it will be infeasible to implement. As discouraging examples, 
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the problems of typability and type checking for many of the polymorphic type systems mentioned 
above have recently been proven undecidable. Type checking and typability were shown to be 
undecidable for System F (cf. recent results submitted for publication elsewhere in [Wel93]) and 
for its very powerful extension, System F u [Urz93]. Other related systems that are not exactly 
extensions of ML have also recently been proven to have undecidable typability, i.e. System F< 
which relates to object-oriented languages [Pie92], and the All-calculus which relates to extensions 
of A-Prolog [Dow93]. 

Against this recent background, it is desirable to demarcate precisely where the boundary 
between decidable and undecidable typability lies within various stratifications of System F. In the 
case of decidable typability, it is also desirable to develop simple and easy-to-implement algorithms 
for the most powerful level within a stratification that is also feasible to use. We undertake this 
task for the stratification of System F by rank of types. 

Contributions of This Paper. We can now firmly establish the boundary for decidability of 
typability and type checking within the stratification of System F by rank of types. The two 
problems are undecidable for every fragment of F of rank > 3 and are decidable for rank < 2. 
The undecidability of type checking at rank > 3 can be seen by observing that the proof for the 
undecidability of type checking in F in [Wel93] requires only rank-3 types. 1 The undecidability 
of typability at rank > 3 results from the fact that the constants c and / defined in section 5 
of [KT92] can be encoded using the methods of [Wel93] in System A 3 (the rank-3 fragment of F) 
and from Theorem 30 of [KT92]. We give this encoding in this paper. Since it was already known 
from [KT92] that typability is decidable for System A 2 (the rank-2 fragment of F), we know exactly 
where the boundary of decidability for typability lies. These circumstances lead us to look for a 
simple and direct algorithm for type inference within A 2 . 

The existing proof that typability is decidable for System A 2 uses a succession of several re¬ 
ductions to the typability problem in ML and results in a type inference algorithm that is neither 
simple nor easy to understand. In this paper, we give a simpler and more direct algorithm for 
the decidable case of typability in A 2 . We first prove that A 2 is equivalent to a restriction named 
System AJj"’* having many convenient properties. We then develop a notion of reduction named 
6 which converts A-terms into a form which is more amenable to type inference but which also 
preserves every A-term’s set of derivable types in AJf’*. The type inference problem in AJf’* for a 
A-term in 0-normal form is easily converted into an acyclic semi-unification problem. Finally, we 
give a simple algorithm for solving acyclic semi-unification problem. The complexity of the whole 
procedure is the same as that of type inference in ML. 

We omit all proofs of all lemmas and theorems in this conference report to remain within the page 
limit. A later extended version of this paper will clarify the relationship between ML-typability 
and typability in A 2 and discuss the issues of type checking and principal types in A 2 . 

Acknowledgements. A number of definitions used in this paper were lifted from [KT92, KTU90, 
KTU93]. 

2 System and System A^ 

In this section, we define first the untyped A-calculus, then System F, then the restriction of System 
F that results in System A k . Then, we define a restriction of System A 2 called System Aj" which 
has equivalent typing power. We use the “Curry view” of type systems for the A-calculus, in which 
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pure terms of the A-calculus are assigned types, rather than the “Church view” where terms and 
types are defined simultaneously to produce typed terms. 

The set of all A-terms A is built from the set of A-term variables V using application and 
abstraction as specified by the usual grammar A ::= V | (A A) | (AV.A). We use small Roman 
letters towards the end of the alphabet as metavariables ranging over V and capital Roman letters 
as metavariables ranging over A. When writing A-terms, application associates to the left so that 
MNP = ( MN)P. The scope of “Xx.” extends as far to the right as possible, while the scope of 
“As” without the covers as little as possible. 

As usual, FV(Itf) and BV(Itf) denote the free and bound variables of a A-term M. By M[x:=N] 
we mean the result of substituting N for all free occurrences of x, renaming bound variables in 
M to avoid capturing free variables of N. We will sometimes use this substitution notation on 
subterms when we intend free variables to be captured; we will distinguish this intention by the 
proper use of parentheses, e.g. in A x.(N[y:=x]) we intend for the substituted occurrences of x to 
be captured by the binding. A context C\-] is a A-term with a hole and if M is a A-term then 
C[M] denotes the result of inserting M into the hole in C[ ■ ], including the capture of free variables 
in M by the bound variables of C[-\. We denote that N is a subterm of M (possibly M itself) 
by N C M. We assume at all times that every A-term M obeys the restriction that no variable is 
bound more than once and no variable occurs both bound and free in M. K denotes the standard 
combinator (Xx.Xy.x). 

The set of all types T is built from the set of type variables V using two type constructors 
specified by the grammar T ::= V | (T—>T) | (VV.T). We use small Greek letters from the 
beginning of the alphabet (e.g. a and (3) as metavariables over V and small Greek letters towards 
the end of the alphabet (e.g. a and r) as metavariables over T. When writing types, the arrows 
associate to the right so that a^r^p = ct—>-(t—> p). We use the same scoping convention for 
“V” as we do for “A”. FTV(r) and BTV(r) denote the free and bound type variables of type r, 
respectively. We give the notation a\a:=r] the same meaning for types that it has for A-terms. We 
write a <t to indicate that a can be instantiated to r, i.e. a = Ma.p and there exist types x such 
that p[a:=x] = r. A 0 denotes that the types y in the substitution contain no quantifiers. We write 
T to denote the type Va.a. 

We have several conventions about how quantifiers in types are treated, ci-conversion of types 
and reordering of adjacent quantifiers is allowed at any time. For example, we consider the types 
Va.V/Ta —► (3, V/3.Va./3 —► a, and V/TVa.a —► /3 to all be equal. Using ci-conversion we assume that 
no variable is bound more than once in any type, that the bound type variables of any two type 
instances are disjoint, and that all bound type variables of any type instance are disjoint from the 
free type variables of another type instance. If a = Vci.r and a £ FTV(r), we say that “Vci” is a 
redundant quantifier. We assume types do not contain redundant quantifiers. 

We define a notation for specifying many quantifiers concisely. For type a and set of type 
variables X C FTV(cr), the shorthand notation MX.a is defined so that V0.cr = a and V(X U 
{a}).a = MaX(X - {a}).cr. This defines just one type because we assume the order of quantifiers 
does not distinguish two types. We may use a to stand for a sequence of type variables ay, ..., 
a n . We allow a to be treated as a set or as a comma-separated sequence as is most convenient, so 
Va.CT has the expected meaning. The notation V.<7 means V(FTV(<t)).<7. 

To define System A k , we will use the following inductive stratification of types. First define 
R( 0) as the set of open types, i.e. types not mentioning V. Then, for all k > 0, define R(k + 1) by 
the grammar R(k + 1) ::= R(k) | ( R(k)—>R(k + 1)) | (V V. R(k + 1)). We say that R(k) is the 
set of types of rank k. For example, Ma.a^M is a type within rank 1 and {Ma.a^a)^M(i.fi 
is a type within rank 2 but not within rank 1. Our definition of rank is equivalent to the notion of 
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VAR 

A h x : a 


A{x) = a 

APP 

AhM: , A h A 

' : a 


A h (M N) : t 



ABS 

A U { x:a } h M : t 

A h (A x.M) : 



INST 

AhM: Ma.a 

AhM: a[ct:=T ] 



GEN 

Ah M : a 

AhM: Met.a 


a i FTV(A) 


Figure 1: Inference Rules of System F and A*. 


rank introduced in [Lei83]. Since R(k) C R(k + 1) it follows that if a type a is within rank k, then 
it is within every rank n > k. Observe that performing the substitution a\a:=r] may not preserve 
rank. The resulting rank depends on the rank of r and how deep in the negative scope of —► the 
free occurrences of a in a are. 

To define AJf, we will use subsets of the type sets R( 0), R( 1), and R( 2) called A(0), A(l), and 
S(2). Let A(0) = R( 0) be the set of all open types. Let A(l) be the set of all types of the form 
Va.oy where a £ A(0). Let A(2) be the set of all types of the form Va.oy—► • • where 

ay,... , a m £ A( 1) and r £ A(0). 

An assertion is an expression of the form A h M : r where A is a type assignment (a finite set 
{xi : oy,... ,x n : a n } associating at most one type a with each variable x), M a A-term and r a 
type. We say this assertion’s type is the type oy—► • • -^a n ^r and an assertion’s rank is the rank 
of its type. An assertion A h M : r is within rank 2 if and only if r is within rank 2 and all the 
types assigned by A are within rank 1. A(x) denotes the unique type a such that that (x:a) £ A. 
FTV(A) is the set of all free type variables in all of the types assigned by A. The notation A[a:=x] 
denotes a new type assignment A' such that if A(x) = a then B(x) = cr[a:=x\. We assume that 
throughout an assertion it is the case that all bound type variables are named distinctly from each 
other and that the bound and free type variables do not overlap (satisfied by ci-conversion). 

We define System F to be the type system that can derive types for A-terms using the inference 
rules presented in Figure 1 with no other restrictions. For every k > 0, we define A k as the 
restriction of F which allows only assertions within rank < k to be derived. We define System 
Ajf as a restriction of System A 2 where the two differences are that (i) in Ajf all assertions must 


INST - 


AhM: Vci.a 
A b M : a[a:=r] 


r £ S(0) 


Figure 2: INST : Replacement for INST in A 2 . 
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have types in 5(2) (thus all assigned types are in 5(1) and all derived types in 5(2)) and (ii) that 
the inference rule INST of A 2 is replaced by the rule INST - described in Figure 2. Theorem 9 
in [KT92] shows that Af types the same set of terms as A 2 with very similar types. Since Af is 
as powerful as A 2 and since its restrictions make analysis of type inference easier, we will use it 
instead of A 2 in this paper. 

If /C is a type inference system, then the notation A h K M : r denotes the claim that A h M : r 
is derivable in 1C. 

3 System Typability Undecidable for k > 3 

Section 5 of [KT92] introduces System A k [C k ] for each k > 3 and Theorem 30 of the same pa¬ 
per proves that typability is undecidable for A k [C k \ for k > 3. The original definition of A k [C k \ 
defined it based on A k by adding two constants, c and /, with predefined types <f> Ctk and <f>f tk . 
A simple alternate definition is to declare that ACM : r is derivable in A k [C k \ if and only if 
4U {c:f> c k , f:f>f k } h M : r is derivable in A k . 

The analysis for A 3 [C 3 ] goes as fohows. For A 3 [C 3 ] the types of the constants c and / are 
4>c,3 = V ex.ex ^(((o ecx) eex) eex) and ej>^ 3 = \/cx.(cx^cx)^((cx^cx)^cx). We construct a context 
C 3 [ ■ ] with one hole: 

/,[•] EE (.7-(^ i2 ,,-(^.-^-))))(Aa:,-.Ka:,-(K(a: i ( a :,-r))[ - ]))(Aw.-.w,-w,-) 

D[-] = (Xf.r(x 1 (fx 1 x 1 ))(x 2 (fx 2 x 2 ))[-])(Xu.Xv.u(v(u(ur)))) 

E[ ■ ] = (A t.r(x 1 (tx 1 (x 1 r)(fx 1 )))(x 2 (tx 2 (x 2 r)(fx 2 )))[ ■])(Xp.Xq.Xs.K(p(pq))(p(sp))) 

GH ee (Xc.r(xi(c(xir)(fxi)))(x 2 (c(x 2 r)(fx 2 )))[-])(tr) 

Cs[-] = X r..h[J 2 [D[E[G[-]]]]] 

Using the methods of [Wel93], it can be seen that this context can be typed in A 3 and in any typing 
of this context (with any A-term placed in the hole), the variables c and / must be assigned the 
types cf ) Ct3 and <f>f t 3 . 

Since for each k > 3 a context C k [-] having the same properties with respect to A k can be 
constructed, we have this result: 

Theorem 3.1 For any type assignment A, there exists a type t such that 4u{c:<) C)i , f:(j)f k }\- A3 M : 
t is derivable if and only if there exists a type t' such that A\~ A3 C k [M] : t' is derivable. Thus, the 
problem of typability for A k [C k \ for k > 3 is reducible to the problem for A k . Therefore, typability 
is undecidable for A k for every k > 3. 

4 System * 

In this section, we observe a number of convenient properties of System A^". We then define System 
AJU* as a restriction of Af that embodies these properties and prove that AJu* is equivalent to Af. 

Definition 4.1 (act) (Taken from [KT92].) Let us define, by induction on A-terms M , the se¬ 
quence act(M), of active variables in M: 

1. act(x) = e (the empty sequence) 

2. act(Xx.M) = x ■ act(M) 


3. act(MN) 


if act(M) = e 

if act(M) = Xi ■ ■ ■ x n , for some n > 1 









Let us observe that due to our conventions, there are no repetitions of variables in act(M). 
The sequence act(M) represents outstanding abstractions in iff, i.e. those abstractions which have 
not been “captured” by an application. For each application subterm Q = RS in a A-term M 
where act(R) = x ■ ■ ■, there is an abstraction subterm N = (A x.P) within R (possibly R itself). In 
this case, we say that the subterms N, Q, and S are companions, specifically, N is the companion 
abstraction, Q the companion application, and S the companion argument of the others. In this 
case, if N is ever /3-reduced, its argument will be S or 5’s /3-descendent. If TV = R, i.e. Q = NS, 
then we say that they are adjacent companions and it is the case that they are a (3- redex. A 
set of non-adjacent companions represents a “potential” (3 -redex in a A-term whose presence can 
be detected by simple inspection without /3-reduction. Companions turn out to have convenient 
properties in Af. 


Definition 4.2 (( ) A ) For a A-term M, we define (M) x as the effect of traversing M and labeling 
each of its abstraction subterms with an index i £ {1,2,3}, depending on the subterm’s position 
and whether it has companions. ( M) x is defined in terms of an auxiliary function label which takes 
as parameters a A-term, a set of variables, and an index. The inductive definition of label follows 
for i £ {1,2,3}: 

1. label(x, X, i) = x 


2. label((Xx.M), X, i) 


' {Xx.label{M,X,ij) ifxeX, 
( Xx.label{M,X,i)) if x f X 


3. label((MN), X, i) = ( label(M, X, i) ■ label(N, act(N),3 )) 

We then finish the definition by saying that ( M) x = label(M,act(M), 2). 


Informally, labeling the A-term M affects each abstraction subterm N as follows. If N has 
companions, then it is labelled with A 1 . If iV does not have companions, then it is labelled with A 2 
if there is no subterm P = LR of M such that N lies within R, the right subterm. Otherwise N is 
labelled with A 3 . When dealing with a labelled A-term M after this point, we will assume that the 
labeling is the result of the ( ) A operator and not any arbitrary labeling, i.e. we assume that either 
M = (N) x or M C (N) x for some unlabelled A-term N. 


Lemma 4.3 If V is a derivation in Af that types the X-term M, and there is an abstraction 
subterm (A x.N) in M, and there is a subterm ( PQ) in M such that x appears in act(Q), and there 
is an assertion A U {x:a} b N : t in V, then a £ 5(0). Restated more informally, the bound 
variable of a companionless, X 3 -labelled abstraction must be assigned a monomorphic type. 


Lemma 4.4 If in Af there is a derivation V ending with the assertion A b M : t, then for 
any type variable substitution [o:=y], it is the case that there is a derivation V ending with the 
assertion A[a:=x] b M : t[q::=x] and, furthermore, V and V are of the same length and there is 
a one-to-one correspondence between rule applications in both derivations. 


Lemma 4.4 is used by Lemma 4.5. For Lemma 4.5, let us temporarily suppose that quantifiers 
introduced into types by the GEN rule are marked. For example, from the assertion A b M : r 
where a ft. FTV(A) we can derive using GEN the assertion A b M : V b a.r. These markers on 
quantifiers do not affect the behavior of the inference rules; they merely allow us to precisely 
phrase the lemma. 


Lemma 4.5 If in Af there is a derivation V ending with the assertion A \~ M : t, then there is 
a derivation V ending with the same assertion such that there is no use of the INST rule whose 
premise is an assertion of the form B b N : M^a.p. In plainer English, we can assume that 
guantifiers introduced by GEN are never instantiated. 
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VAR* 

Ah x : Va.r 

APP* 

Ah M : a^T, Ah N : a 

A h (M N) : Va.r 

APP * + 

Ah M : a—>T, Ah N : a 

A h (M N) : Va.r 

ABS*’ 1 ’ 2 

A U {x\(j} h M : t 

A h (X'x.M) : Va.a^r 

ABS*' 3 

AU{x:a}h M :t 

Ah (A 3 x.M) : yd.a—^r 


A(x)A°t , r G 5(0), a £ FTV(A) 
a, r G 5(0), act(M) = e, a t FTV(A) 
r G 5(0), act(M) ± e, a i FTV(A) 
t G 5(0), i G {1, 2}, a ^ FTV(A) 

<7, r G 5(0), a^FTV(A) 


Figure 3: Inference Rules of System A 2 


Lemma 4.6 If V is a derivation in Af that types the X-terrn M, and V includes the assertion 
A b N : \fa.T, and there are no subsequent assertions in V for the subterm N that are derived from 
this assertion, then either N = M or there is a subterm (PN) in M where act(P) e. Rephrased, 
the only proper subterms for which the final derived type may be a M-type are companion arguments. 

Lemma 4.7 results from Lemmas 4.5 and 4.6. 

Lemma 4.7 If V is a derivation in Af that types the X-term M, and V includes the assertion 
A b N : Va.r as a consequence of the GEN rule, then N is a companion argument. 

Lemma 4.8 If in Af there is a derivation V ending with the assertion A \~ M : t, then there is 
a derivation V ending with the same assertion such that if the assertion B h N : a in V is the 
consequence of the INST rule, then N G V, i.e. N is a variable. In other words, we can assume all 
uses of the INST rule occur at the leaves of the derivation (viewing the derivation as a tree). 

We now define the new System AJf’* to formally include the restrictions proven by the previous 
lemmas into a type system. We present the inference rules for AJf’* in Figure 3. As in Af, all 
assertions are required to be within rank 2. 

Theorem 4.9 A\~^ a -^M : t holds if and only if Ah ( - A -..^(M) A : r holds, i.e. every Af typing is 
equivalent to a Af’* typing and vice versa. 

5 0-Reduction and System 

In this section, we define a new notion of reduction and then use it to reduce System Af’* typability 
to an even more restricted type discipline, System AJf’*’ . 

Definition 5.1 ( 0 ) We define 4 notions of reduction denoted # l5 0 2 , # 3 , and # 4 which will transform 
a labelled A-term (M) A in a useful way. These transformations are defined as follows: 

• 0i transforms a subterm of the form (((A 1 x.N)P)Q) to ((A 1 x.NQ)P). 



• 0 2 transforms a subterm ( X 3 x.(X 1 y.N)P ) to (( X 1 v.X 3 x.(N[y:=vx]))(X 3 w.(P[x:=w ]))), where 
v and w are fresh variables. 

• 0 3 transforms a subterm of the form (N((X 1 x.P)Q)) to ((A x x.NP)Q). 

• 0 4 transforms a subterm of the form (( X 1 x.(X 2 y.N))P ) to ( X 2 y.((X 1 x.N)P )). 

Capture of free variables in 0 1 , 0 3 , and 0 4 does not occur due to our assumption that all bound 
variables are named distinctly from all free variables. 0 l5 0 3 , and 0 4 affect subterms that are 
applications, while 0 2 is applied to subterms that are abstractions. When A-terms are viewed as 
trees, 0 l5 0 2 , and 0 3 can be seen to have the effect of hoisting (3 -redexes higher in the transformed 
term, while 0 4 has the effect of raising an abstraction above a [3- redex. In section 6, we will 
use properties of these transformations to prove that a typability problem is reducible to acyclic 
semi-unification. 

We use the notation Oi where i £ {1,2, 3,4} to stand for one of 0 l5 0 2 , 0 3 , or 0 4 . We define Oij 
to be Oi U 6j and define 6 = 01,2,3,4. Since these transformations are all notions of reduction, ~^ 01 , 
-^ 12 , —etc., have the expected meaning. 

We say that a term is in 6-normal form if it has no 0-redexes. A 0-normal form of M is a 
A-term N in 0-normal form such that M N. A A-term may have more than one 0-normal 
form, e.g. the A-term (ffXx .M)N)(fXy.P)Q)) has two 0-normal forms, (fXx .(Xy .M P)Q)N) and 
((Xy.(Xx.MP)N)Q). 

We now prove a variety of useful properties of 0-reduction. 

Lemma 5.2 Let M be in 9-normal form. M is of the form: 

X 2 Xl .X 2 x 2 ... .X 2 x m .(X 1 y 1 .(X 1 y 2 .(.. .((AA/ n .T n+1 )T n ).. .))T 2 )T 1 

where m > 0, n > 0, anrl where T 1; ..., T n+1 are X-terms in (3-normal form. Furthermore, any 
abtractions within T; for 1 < i < n + 1 are X 3 -labelled. Thus, all X 1 -labelled abstractions belong to 
/3-redexes, i.e. there are no non-adjacent companions. 

The A-term M detailed in Lemma 5.2 can also be viewed as the following ML term: 

fn x 4 =y fn x 2 =>■ ... =>■ fn x m =y let y 4 = T 4 in let y 2 = T 2 in .. .let y n = T n in S 

Lemma 5.3 0 1; 0 2 , 0 3 , and 0 4 always transform a X-term M into a (3-equivalent X-term N, i.e. if 
M ~^ 0 N, then M = p N. 

Lemma 5.4 9-reduction always terminates, i.e. it is strongly normalizing. 

Lemma 5.5 We can assume that the type assigned to the bound variable of a X 1 -abstraction which 
is the function of a (3-redex has no free type variables that are not also free somewhere else in the 
type assignment. 

Lemma 5.5 is used by Lemma 5.6. 

Lemma 5.6 If 9 4 , 0 2 , 0 3 , or 0 4 transform M into N in one step, then with any particular type 
assignment, both M and N are typable with the same types in Af'*. In other words, if M N, 
then in Af’* it holds that A\~ M : i r is derivable if and only if A \~ N : i r is derivable. As a result, 
A\-( A -,,^M : t is true if and only if A\-^ A -,*^9-nf(M) : r is true. 

Lemma 5.7 act(9-nf((M ) x )) = act(M). 
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A(x) A° r, r G 5(0) 


VAR 0 Ahx :t 


APP 0 

LET 0 


ABS 0 - 1 


ABS 0 - 2 


ABS 0 - 3 


A h M : a^r, A h N : a 
Ah (MN):t 

AU{x:V.a}h M :t, Ah N : a 
Ah ((A 1 x.M)N) : r 
iU{rV.a) h M : t 
A h (X 1 x.M) : (V.<r)->r 
4U{rl} h M : r 
A h (A 2 x.M) : T^r 
AU{i:ff}hM : r 

AF(AT)rMyT^7 


cr, r G 5(0), act(M) = e 
< 7 , t G 5(0) 
r G 5(0) 


a, t e 5(o) 


Figure 4: Inference Rules of System A 2 


Lemma 5.8 ( From [KT92].) In A 2 if A h M : p is derivable and \act(M)\ = ra, 5*en p = 
Va-cy—► .. .—?a n —?T and a G 5(1) and t G 5(0). 

Lemma 5.9 IFe can always assign the type T = \fa.a to the bound variable of a companionless, 
X 2 -labelled abstraction without affecting the whole X-term’s typability. 

Lemma 5.10 Under the restriction that the outermost type assignment assigns the type T to all 
variables, we can always assign universally closed types to the bound variables of every X 1 -labelled 
abstraction without affecting the whole X-term’s typability. 

We now define System Af’*’ e to take advantage of the typing properties of A-terms in 0-normal 
form in AJf’*. System Af’*’ e is intended to be used only for 0-normal forms; its behavior on other 
A-terms has not been investigated. The inference rules for A^"’*’ e are presented in Figure 4. As with 
Af’*, assigned types must be in 5(1) and derived types must be in 5(2). 

Theorem 5.11 Typability and type inference in Af’* are reducible to the same problems in Af’*’ e . 

For a labelled X-term M where \act(M)\ = n, if Ah, K -,.sM : V oi.u ►- >cr n —fT holds, then using 

the type assignment B that maps all variables to type T it is the case that Bh, A -,*,e.Q-nf(M) : 
_L^ • • -^T^r holds as well. If Ch^ A -,*,e^0-nf(M) : p holds, then Ch^ A -,,^M : p must hold as well. 

6 System Type Inference Reducible to ASUP 

In this section we define the problem of acyclic semi-unification, give an algorithm for solving 
it, and develop a construction for reducing the problem of typability in System Af’*’ e to acyclic 
semi-unification. 

For convenience, we define semi-unification using the set of open types R( 0) as the set of 
algebraic terms T. Let A = V denote the set of term variables to emphasize their use in algebraic 
terms as opposed to types. Although the members of T are also types, we will refer to them as 
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terms when using them in semi-unification. A substitution is a function S : X —► T that differs 
from the identity on only finitely many variables. Every substitution extends in a natural way to 
a -^-homomorphism S : T —► T so that An instance T of semi-unification 

is a finite set of pairs (called inequalities) in T X T. Each such pair is written as t < p where 
r,fi E T. A substitution S is a solution of instance T = < pi,... , r n < p n } if and only if there 

exist substitutions f? l5 ... ,R n such that: 


RfiSin j) = S(p i) , ... , if n (5(r n )) = S(p n ) 

For an arbitrary term r, we dehne the left and right subterms of r, denoted L(t) and R{t). More 
precisely, if r is a variable then L(t) and R(t) are undefined, otherwise we set E(r 1 ^r 2 ) = r 1 and 
R(t 1 —^t‘ 2 ) = r 2 . If II G {L, R}*, say II = x^x 2 ■ ■ ■ x p , the notation II(r) means xfixfi- ■ ■ (x p (t) •••)). 
For an arbitrary II E {L,R}*, the subterm II(r) is defined provided II (read from right to left) is 
a path (from the root to an internal node or to a leaf node) in the binary tree representation of r. 

An instance T of semi-unification is acyclic if it can be organized as follows. There are n + 1 
disjoint sets of variables, V 0 , .. .,V n , for some n > 1 , such that the inequalities of T can be placed 
into n columns: 

T 1,1 < fi 1,1 t 2 ’ 1 < ffi 2 ’ 1 . r "’ 1 < fi 1 ’ 1 

T 1,2 < jl 1 ' 2 T 2,2 < fi 2,2 . T n ’ 2 < fi n ’ 2 


r 1>ri < fi 1 ’ ri T 2 ’ r2 < /J , 2 ’ r2 . T n ' rn < fi n ’ rn 

where: 

V 0 = FV (r 1,1 ) U ••• U FV (r 1,ri ) 

V = FV(p 1,1 ) U ••• U FV{jj 1 ' ri ) U FV(t 2 ’ v ) U ••• U EE(r 2 - r2 ) 

E n _i = FVijF- 1 ’ 1 ) U ••• U FV{jj n ~ 1 ’ rn - 1 ) U FV (r"’ 1 ) U ••• U FV(T n ’ rn ) 

V n = FViyF' 1 ) U ••• U FV(ji n ’ r -) 

The acyclic semi-unification problem (henceforth abbreviated ASUP) is the problem of deciding, 
for an ASUP instance T, whether T has a solution. 

We now dehne a procedure which constructs a solution for ASUP instance T if T has a solution 
and otherwise answers that there is no solution. This procedure is a modification of the procedure 
defined in [KTU93] which consists of repeatedly reducing redexes, which can be of two kinds, and it 
halts when there are no more redexes or when a conflict is detected that precludes a solution. Each 
reduction substitutes a term for a variable throughout T and the composition of the reductions 
done so far represents the construction of the solution. 

• (Redex I reduction) Let £ E X and let r' ^ I be a term with the property that there is a 
path II E {L, R}* and t < p is an inequality of T such that: 

II(r) = t’ and n(/r) = £ 

The pair of terms (£,T(r')) where F is a one-to-one substitution that maps all variables in 
r' to fresh names is called a redex I. Reducing this redex substitutes T(r') for all occurrences 
of £ throughout T. 
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• (Redex II reduction ) Let £ E X and pi E T have the property that £ p! and there are paths 

II, A, S E {T, R}* and t < p is an inequality in T such that: 

II(r) = A(r) E X and Sn(/r) = £ and SA(/i) = p' 

Such a pair (£, p r ) is called a redex II. Reducing this redex consists of substituting p! for all 
occurrences of £ throughout T. However, if there is a path 0 E {L,R}* such that Q(p') = £, 
then no solution to T is possible, so the procedure halts and outputs the answer that there is 
no solution if this is detected. 

Although the general case of semi-unification was proven to be undecidable in [KTU93], we 
have the following result for ASUP: 

Lemma 6.1 For an instance T of ASUP, the redex procedure either constructs a solution S to T 
and halts or correctly answers that T has no solution and halts. 

To solve the typability and type inference problems for Af’*’° for A-terms in 0-normal form, we 
construct an ASUP instance T. Consider the labelled A-term M in 0-normal form: 

M = X 2 Xl .X 2 x 2 .. ..X 2 x m .(X 1 y 1 .(X 1 y 2 .(... ((AU/ n .T n+1 )T n ). . .))T 2 )T 1 

We will adopt the convention that the abstractions in the component R bind variables named z itl , 
z i<2 , etc. By writing the inequality (r < 8 - p), we assert that the inequality will belong to column i 
of T. Most of the inequalities will be of a certain special form, so (r =• ^) denotes the inequality 
(a—?a <i r—?p) where ci is a fresh variable mentioned in no other term in T. This will have the 
effect of unifying r and p as in ordinary first-order unification. We will assume that the subterms 
of M are indexed so that two otherwise identical subterms in different positions within M will be 
considered distinct in what follows. 

We construct T as follows. In constructing the instance T of ASUP, each subterm N C R will 
contribute one inequality, each (3 -redex ((A 1 y i .P i )T i ) will contribute one inequality, and for each 
variable y t there will be 1 + n — i inequalities. For each subterm N of R, the term variable S N 
will represent the derived type of N. For each bound variable z it j (which must be monomorphic), 
the term variable 7will represent its assigned type. For each bound variable yi (which must 
be universally polymorphic), the term variables ..., /3 n y will represent its assigned type. For 
each occurrence of Xj (which will be assigned the type _l_), there will be no particular variable to 
represent its type, since it is unconstrained. 

Now we define the inequalities that will be in T. For each subterm N of R, we add an inequality 
to T that will depend on N: 

1. For N = Xj, we do not add any inequality. 

2. For N = yj, we add {fii-iy <; S N ). 

3. For N = Zij, we add (7^ =• S N ). 

4. For N = ( PQ ), we add (S P =* 6 Q ->6 N ). 

5. For N = ( X 3 z it j.P ), we add (7 i,j—*hp =i h N ). 

For each /3-redex ((A^-.P^Tj), we add the inequality =• S Tt ). Finally, for each bound variable 
yj and for each i E {j + 1,..., n + 1}, we add the inequality {fii-iy < Ay)- 

Theorem 6.2 Phe ASUP instance T has a solution S if and only if the X-term M in 9-normal 
form is typable in Af’*’ e . Furthermore, if M is typable in Af’*’ e , the type _L^ • • • ^T^(N(^ t „ +1 )) 
where the number of “L” components of the type is m (the number of variables in the sequence x x , 

..., x m , also the value of \act(M)\) is a type derivable for M in Af’*’ e . 
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We can finally descripe our type inference algorithm for System A 2 . If M is typable in A 2 , then 
the following procedure will produce a type for it and will otherwise answer that M is not typable: 

1. Compute the labelled M 1 = (M) A . 

2. Compute the A-term M 2 = 0-nf (Mi) using ^-reduction. 

3. Compute the ASUP instance P. 

4. Run the redex procedure on P to either produce a solution S for P or the answer that P has 
no solution. In the latter case, halt with the answer that M is not typable in A 2 . 

5. Compute and output the type _L—► • • • —►!—►(,5'(£ Tii+1 )) where the number of “A” components 
is \act(M)\. 

It was shown in [KT92] that A 2 typability is DEXPTIME-complete (where DEXPTIME means 
DTIME(2 n0<1) )). We have just developed an algorithm that reduces A 2 type inference to ASUP 
in polynomial-time. ASUP was shown to be DEXPTIME-complete in [KTU90]. Therefore, our 
algorithm is optimal. 
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